In order to preserve the folder structure of the data, we have removed the .zip extensions from Archive_dot_zip. Just add .zip to the end in order to unzip it. (If we did not do this, Dataverse would forcibly unzip upon upload, which would remove the folder structure.)

The replication data is organized into three major folders: “representation results”, which replicates the results of the first half of the paper, through Figure 4; “survey experiments”, which replicates the results of Studies 1-3 of our survey experiments; and “trump study”, which replicates Table 4, our application study on Donald Trump; and “si extremism results”, which replicates our analyses in SI-3.

In “representation results”, the code in “Cleaning Code” can be run in numerical order to create the data we use for our analysis. “cces cleaning code for reference.do” should be run first for a full replication, using the CCES files available from the CCES data verse as inputs. We do not re-upload the CCES here as it is multiple gigabytes in size. Each script in the “Cleaning Code” folder contains a short description of its purpose at the top. “Cleaning Data” contains the data that “Cleaning Code” produces, which is saved in “Analysis Data.” “Analysis Code” then uses the data files that “Cleaning Code” has created to produce the Figures and Tables in the paper. The file names of each file give the Figures/Tables that it replicates. Figures 1-4 are produced here, as well as Figure SI 1-4 and 11-18 and Table SI 7. There is also an in-line result that “in line result on page 15 - how often voted with median.R” reproduces.

In “survey experiments”, “Analysis Code” contains the scripts used in Studies 1-3 in the second half of the paper. This code analyzes the data in “Analysis Data.” We have also made “Survey Experiment Setup Code for Reference” available, which shows the process we used to clean the raw survey data and calculate the candidates shown in the vignettes.

In “trump study”, “Table 4.do” contains the short .do file that analyzes “ideology_primaries_issues.dta” to produce the results in Table 4.

In “si extremism results”, “wave 1 descriptive statistics” contains code to replicate Figure SI-5 and “wave 1 and 2 over time opinion and vote choice analysis” contains code to replicate Figures SI-6 and SI-8.
